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Abstract We discuss microscopic mechanisms of complex network growth, 
with the special emphasis of how these mechanisms can be evaluated from 
the measurements on real networks. As an example we consider the network 
of citations to scientific papers. Contrary to common belief that its growth 
is determined by the linear preferential attachment, our microscopic mea- 
O ' surements show that it is driven by the nonlinear autocatalytic growth. This 

O I invalidates the scale-free hypothesis for the citation network. The nonlin- 

earity is responsible for a dramatic dynamical phase transition: while the 
citation lifetime of majority of papers is 6-10 years, the highly-cited papers 
have practically infinite lifetime. 
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\^ • 1 General introduction 

A lot of empirical evidence for the power-law degree distribution in natural 
ps) I networks has been amassed during last decade. This led to the conjecture that 

■ these networks are scale-free. It is widely believed that the growth of the scale- 

rs ' free networks is driven by the cumulative advantage mechanism [T] which 

is commonly known as the preferential attachment _2 , 3 . This mechanism 
assumes that Ak, the number of links acquired by a node during a short 
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time interval At is determined by the number of already acquired links k, 

Ak = A{k + ko) (1) 

Here, k is the node degree, fco is the "initial attractivity" and A is the attach- 
ment rate (aging function) which is time-dependent. Equation [T] yields the 
power-law degree distribution, P{k) cx 1/k'^ , which is generally considered as 
a fingerprint of a scale-free network. The linear preferential attachment (Eq. 
[T|) is believed to be one of the most important microscopic mechanisms that 
generates the scale-free complex networks which are so ubiquitous in nature. 

This statement is often reversed and the power-law degree distribution 
in a growing network is considered as an evidence for the linear preferential 
attachment. The parameter ko is estimated from the exponent of the degree 
distribution [5]: 

7-2+^. (2) 
m 

where m is the mean degree. This approach meets several difficulties. First 
of all it yields unrealistically high fcg ~ m. Second and most important- the 
validity of the power-law approximation for degree distribution in complex 
networks has been contested. Indeed, since the node degree is a discrete 
and non-negative number, the scale-free power-law function cannot provide 
a good fit for the nodes with small degree. At best, it can fit only the fat tail 
of the distribution. However, several recent studies showed that the degree 
distribution in complex networks can deviate from the power-law dependence 
even in the fat tail [1[51I1[71IH] . 

Krapivsky and Redner [9] showed that the deviation necessarily occurs if 
the attachment kernel is nonlinear, 

Ak = A{k + ko)" (3) 

In particular, for sublinear attachment kernel, a < 1 the network is char- 
acterized by the stretched exponential degree distribution; while for the su- 
perlinear kernel, a > 1, the network organizes into a "winner takes all" 
configuration [MTU] . While for linear attachment kernel the network achieves 
stationary degree distribution, for the nonlinear case the degree distribution 
is nonstationary. In the sequel we call the dynamics governed by Eq. [3]as the 
"nonlinear autocatalytic growth" [m[T2| and reserve the term "preferential 
attachment" for Eq. [T] that generates the power-law degree distribution. 

To what extent the growth mechanism of real networks deviates (Eq. [1]) 
is an important question. Recent experimental studies [H [T^[ni[T51ll61ll71ll8l 
fT9l[20l[2Tll22U23l[24] that measured microscopic growth of complex networks, 
came up with the conclusion that the attachment exponent a is close to unity, 
in such a way that the growth mechanism is nearly linear (see Table I) and 
thus follows Eq[TJ However, to which extent a deviates from unity remained 
an open question until now. The above studies could hardly measure this 
deviation due to time-dependence of a, finite precision limited by the size of 
their databases and, most important - due to uncertainty arising from the 
use of different methodologies. In particular, Ref. [H] applied four different 
methods to measure attachment kernel in the network of the US patent-to- 
patent citations and found different exponents ranging from 1.12 to 1.38. 
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Table 1 Testing preferential attachment in real networks 



Network 


Ref. 


Attachment 
exponent a 


Method 


Citations 


Jeong et al. [T51 


0.95 


cumulation 


of scientific papers 


Eom &Fortunato [15] 


1 


cumulation 




Redner [3] 


0.9-1.05 


running average 




Wang et al. QSi 


1 


running average 




Golosovsky &SoIomon |28) 


1-1.25 

{grows with time) 


histogram 


Citations 


Csardi et al. [17| 


1.2 


histogram 


of US patents 


Sheridan et al. 1241 


1.23-1.27 


Metropolis-Hastings 




Valverde et al. [16j 


1-1.25 

{grows with time) 


cumulation 


Scientific 


Jeong et al. [13] 


0.8 


cumulation 


collaboration 


Tomassini &Luthi [19] 


0.76 


cumulation 




Newman [^ 


1 


histogram 


Movie actors 


Jeong et al. I13| 


1 


cumulation 




Eom et al. [14) 


1 


cumulation 


Wikipedia 


Capocci et al. [21| 


0.76 


histogram 


Google 


Jeong et al. I13| 


1.05 


cumulation 


Internet 


Eom et al. [14| 


1 


cumulation 


Internet Dictionary 


Herdagdelen et al. [22| 


1 


histogram 


Protein networks 


Eisenberg fcLevanon |23j 


1 


cumulation 



The goal of our present study is the high precision measurement of the 
microscopic growth rate of a complex network and the determination of the 
attachment exponent a. Following the accepted practice [H IT^ITSIITB] . as an 
object of our research we chose one of the best-documented complex net- 
works: citations to scientific papers. Here, the papers are nodes and citations 
to these papers are links. We performed high-statistics and time-resolved 
study of the citation dynamics of a very large and homogeneous set of pa- 
pers. In what follows we compare two methods of measuring the microscopic 
growth rate of this network: averaging (histogram) and cumulation. We found 
that the former method is quite reliable and yields superlinear attachment 
kernel, a w 1.25, while the latter method is prone to quantization errors. We 
came to conclusion that the microscopic growth mechanism of the citation 
network follows nonlinear autocatalytic growth (Eq. [3]) with a > 1. 

We elaborate on a dramatic consequence of nonlinearity: if one considers 
a citations dynamics governed by the superlinear attachment kernel, one is 
led to conclusion that this network contains a subset of the papers that will 
be cited forever. Thus we witness a dynamical phase transition in which 
citation lifetime of a paper diverges to infinity. Our measurements provide 
experimental evidence for such runaway papers that have practically infinite 
citation lifetime. 
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Fig. 1 Time dependence of the fraction of uncited papers Pq and of the mean 
number of citations m for 40195 Physics papers published in 1984. t is the number 
of years after publication, whereas the publication year corresponds to t — 1. The 
continuous lines are guide to the eye. While the number of uncited papers saturates 
after 15 years, the mean number of citations does not saturate even after 25 years. 



2 Methodology 

To assess the microscopic growth mechanism of the citation network we fo- 
cussed on one discipline- Physics. We considered a cohort of papers published 
in the same year Tpuu and measured the number of citations garnered by 
each paper in every subsequent year Tdt ■ To this end we used the Thomson- 
Reuters ISI Web of Science, chose 82 leading Physics journals, excluded 
review articles, comments, editorial, etc., and analyzed citation history of 
40,195 original research papers published in these journals in Tpuu = 1984. 
The cumulative citation distributions for this data set were demonstrated 
elsewhere [8]. Figure [1] shows some aggregate characteristics of this set: the 
mean number of citations and the fraction of uncited papers. 

In what follows we focus on two variables: (a) ki^t - the cumulative number 
of citations, i.e. the total number of citations accumulated by a paper i in the 
period between Tputi and Tdt', and (b) Aki^t+At - the number of additional 
citations gained by the same paper in a short time window between Tdt 
and Tat + At. Here, At ^ 1 year and t = Tpuu - Tdt + 1 (if Tputi = T^t 
then t = 1, in such a way that ki^i measures the number of citations during 
the year when the paper was published). Figure [2] shows Aki^t+i versus ki^f 
(Specifically, fc^^g is the number of citations garnered by a paper i from 1984 
to 1989 while Aki^i is the number of citations garnered by the same paper 
in 1990.) While the trend of increasing Aki^t+i versus fc^ t is clearly visible, 
the fluctuations are so strong that Fig. [2] does not provide an obvious proof 
of the validity of Eq. [TJ 
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Fig. 2 The scatter plot of the number of additional citations Aki^j, garnered by 
each paper i during seventh year after publication. The horizontal axis shows fei,6 
-the total number of citations garnered by the same paper during six previous years. 
The solid line displays approximation by Eq. |4]with a — 1.13, ko = 1, A — 0.065. 



This is not unexpected since the actual number of newly acquired cita- 
tions is a stochastic variable. We define Xi{t) = Akn/At which is the average 
citation rate over the ensemble of the nodes with the same ki^f The auto- 
catalytic growth model actually claims that = A{ki + fco)", in such a way 
that 



Ak.^t = A{h + kaTAt + adW{t) 



(4) 



where adWlt) is a random variable with zero mean and ct^ variance (for 
brevity we replace thereon t-\-l hy t). In contrast to Aki^t which is a discrete 
variable, Xi{t) is a continuous one. To verify whether the noisy data, such 
as those shown in Fig. [51 are generated by the growth law suggested by Eq. 
m there have been developed two methods: averaging (histogram) and cu- 
mulation. The processing of our data using these methods yielded conflicting 
results. In what follows we compare these two methods and develop a control 
tool to check their internal consistency. 
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Fig. 3 Mean number of additional citations, A(fc) — Aki, as a function of the 
number of previous citations k{t); t is the number of years after publication. The 
additional citations are counted in the time window of At = 1 year. To include 
uncited papers (k — 0) the horizontal axis displays k + 1 instead of k. Each set of 
points corresponds to a certain citing year. The straight dashed line shows linear 
approximation Ak oc {k + ko) where fco ~ 1- The data deviate upwards from this 
linear dependence, especially at f = 15 — 24. The continuous lines show better, 
superlinear fits, Ak = A{k + fco)" where A, a and fco are fitting parameters. The 
superlinear dependences fit the highly-cited papers (fc > 100) and uncited papers 
(fc — 0) as well. 

3 Comparison between different methods to measure the 
microscopic growth law of citation network 

3.1 Histogram (Averaging) Method 

To infer the microscopic growth law from the noisy data such as those shown 
in Fig. [21 one bins the data, finds the mean A = Aki for each bin, and 
compares the resuhing histogram to the prediction of Eq. 21 This approach 
was first used by Newman to verify the linear preferential attachment 
hypothesis in real networks. The Refs. [1711211122] followed this approach as 
well, while Refs. [HUH] used a very similar moving average procedure. 

To process our data in such a way we chose a certain citing year Tdt, 
grouped all papers into ~ 40 logarithmically-spaced bins, each bin containing 
the papers with close k, found the mean number of next year citations A for 
each bin and plotted it versus k. Figure [31 shows such A(fc) dependences. In 
particular, the black circles indicate the results of the averaging procedure 
applied to the data of the Fig. [2l The A(fc) dependences are fairly well fitted 
by Eq. H 
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Fig. 4 Time dependence of the parameters of Eq. U (a) Exponent a. (b) Initial 
attractivity fco- (c) Rate constant A. The continuous hne in (a) is a guide to the 
eye while in (c) it shows empirical approximation A = 3.3/ {t + 0.3)^ where t is 
the number of years after publica tion. The blue squares in (b) and (c) show the 
estimates of fco and A based on Eqs. l6l7l and Fig.[T] correspondingly. The consistency 
of ko and A obtained by two methods [circles vs squares] validates the superlinear 
preferential attachment, a > 1. 



Figure m shows time dependence of the fitting parameters a,kQ,A. The 
exponent a gradually increases with time from a = 1 to a = 1.28, indicat- 
ing linear attachment kernel for "young" papers and superlinear attachment 
kernel for "old" papers. The initial attractiveness fco ~ 1-1 is almost time- 
independent and is surprisingly close to ad hoc assumption of de SoUa Price 
[T]. The time dependence of the attachment rate A follows the empirical 
power-law dependence, A — 3.3/{t + 0.3)^. A similar power-law dependence 
can be inferred from the US patent citation data of Ref. [17] . 
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3.2 Cumulation Method 

Jeong, Neda, and Barabasi [TB| were the first to measure the growth rate of 
evolving networks used the cumulation method. This method quickly became 
the most popular tool to assess the preferential attachment in real networks 
[HlHIIgllllllllgil. It consists in calculation of the kernel K(fc) = Ak'dk' 
where k' is the total number of citations garnered by a paper by year Tdt 
and Ak' is the number of citations accrued by this paper during time window 
between Tdt and T^t + At where At is usually 1 year. The integration is 
performed over all papers that garnered k' < k citations by year Tdt- The 
key assumption behind this scheme is that the fluctuations in Ak are averaged 
out and the resulting integral is the same as if Eq.|4]were integrated directly 
over k at fixed t i.e., 

^ik)-^[{k + kor+'-k^+'] (5) 

Figure [5] demonstrates application of this method to our data. The fluc- 
tuations have been dramatically reduced, as expected. Equation [5] fits well 
the data for high k, while for low k the fit is less satisfactory. Figure [B] shows 
the fitting parameters. We found that the deviation of the exponent a from 
unity is within the experimental uncertainty, Aa — ±0.05. Therefore, to find 
A and ko we set a = 1 in our fitting procedure. 

Figure ini shows that the fitting parameters found in such a way are notably 
different from those found by the averaging method (Fig.|4]). Most important 
- the exponent a is close to unity while that found from the averaging method 
is higher than unity. The initial attractivity kg is high and increases with time, 
while that found from the averaging method is close to unity and almost time- 
independent. The attachment rate A exceeds that found from the averaging 
method, especially at long times. The discrepancy between the two methods 
calls for some control tool. In what follows we develop such tool and use it 
to decide which method: averaging or cumulation is more reliable. 



3.3 Control Tool 

We consider here an additional tool to estimate the microscopic growth pa- 
rameters of a growing network. We have developed this indirect method to 
check the internal consistency of the histogram and cumulation methods. 
This control method is based on two assumptions: (i) the microscopic growth 
law given by Eq. |3]is valid for all papers including uncited ones, and (ii) the 
exponent a is known. Then, the microscopic parameters A and ko may be es- 
timated from the dynamics of the macroscopic parameters: the mean number 
of citations m, and the fraction of uncited papers Pq- 

The mean, m — ki{t), is the average number of citations garnered by 
a paper during the period between Tpubi and Tdt, in such a way that the 
age is t = Tdt — 2p«b/ + 1- The averaging here is performed over all papers. 
Differentiation with respect to time yields the average number of additional 
citations garnered by a paper between t and t + At, namely, ^At = Aki. 
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Fig. 5 Integrated number of additional citations «:(fc) = Ak'dk' , as a function 
of the number of previous citations k; t is the number of years after publication. We 
used the same raw data as those shown in Fig. [2] and applied trapezoidal numerical 
integration routine implemented by MATLAB. The dashed line shows quadratic 
dependence k cc k^ as expected for the linear preferential attachment. The data at 
high k follow this dependence as if they were generated by the linear preferential 
attachment, a « 1. The continuous lines show fit given by Eq. [5]with a — 1 and A 
and ko as fitting parameters. 



We average Eq. [Dover all papers and for Z\t = 1 we find ^ = A{k + ko)" « 
A(m + kg)" (the last approximation holds because a is close to unity). This 
yields the rate constant 

dm 1 . . 

(6) 

dt (m + fco)° ^ ' 

For fc = Eq. |3] reduces to Aq — Ak^ where Aq is the average citation rate 
of previously uncited papers. The latter can be recast through the fraction 
of uncited papers Po as follows, Aq ~ "p"^t^ [IS]- Equation [5] yields then 

We solve Eqs. I6I7I for known a and find A and ko. If a is found properly 
the parameters A and ko obtained by this control method shall be consistent 
with those found directly. 

For the histogram (averaging) method, the A and ko found from Eqs. 
I6I7I are indeed consistent with those found by the direct procedure (Fig. H]). 
The difference in ko is within the measurement uncertainty, while a small 
difference in A can be traced to the Jensen's inequality, a:" > for a > 1. 
However, for the cumulation procedure, the A and ko found from Eas. l6l7l are 
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Fig. 6 Time dependence of the parameters of Eq. [4] as found using numerical 
integration (Eq.lSJ. (a) Exponent a. (b) Initial attractivity fco- (c) The rate constant 
A. The blue squares in (b) and (c) show, correspondingly, the estimates of ko and 
A based on Eqs. [6l [7] and the data of the Fig. [S] The resuhs obtained by two 
methods [circles vs squares] strongly differ. This casts doubt on the validity of the 
cumulation method as applied to citations and especially on its claim of the linear 
preferential attachment, a — 1. 



clearly inconsistent with those found directly (Fig. [5]): A is substantially lower 
and fco is also much smaller than those found directly. This inconsistency calls 
for a deeper consideration of the validity of the cumulation method as applied 
to citations. 

While the cumulation method works well for noisy continuous data with 
Gaussian fluctuations, its applicability to citations is problematic. Since the 
additional citations Ak are discrete and non-negative, their fluctuations around 
the mean are non-symmetrical and their magnitude is on the order of the 
mean (see Fig. [2]). It appears that the standard numerical integration proce- 
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Fig. 7 The annual number of published Physics research papers (not including 
conference proceedings). The solid line shows exponential approximation corre- 
sponding to 1.3% annual growth. 



dure as implemented, for example, in MATLAB does not work well for dis- 
crete, wildly fluctuating data that have non-symmetrical distribution around 
the mean. The straightforward application of the numerical integration pro- 
cedure (cumulation) for quantifying dynamics of growing networks is thus 
ineffective. This method shall be specially tailored for the discrete variables 
with the non-Gaussian and strongly skewed fluctuation spectrum. 

We conclude that the microscopic parameters of the growing citation 
network obtained by the averaging (histogram) method are most reliable. 



4 The effect of the exponential growth of publications on citation 
dynamics 

Most theoretical studies consider networks that grow linearly in time. In 
fact, they define a "network time" in such a way that new nodes are added 
to network at constant rate. It should be noted that citation networks grow 
exponentially with time. In what follows we define the network time for the 
citation network and recast our results in terms of the network time. 

Figure [7] shows the annual growth of the number of original research 
papers and reviews in 82 leading Physical journals as covered by the ISI 
Thomson-Reuters Web of Science (excluding editorials, conference proceed- 
ings, etc.). The network growth is close to exponential, hence the difference 
between the physical time and network time is essential. 

We define the " network year" At* in such a way that the number of papers 
published during this time interval is equal to the number of papers published 
in 1984. (This is approximately equivalent to time rescaling, t* = ^ ^ 032~^ 
where t is the physical time.) To determine microscopic parameters of the 
citation dynamics we use Eq. 2] where At — 1 year is replaced by At* . The 
data shown in Fig.[3]are unaffected by this transformation although now they 
refer to network time which is different from physical time. The parameters 
a and fco remain the same. The new rate constant A* is determined from the 
slope of the dependences shown in Fig. |3] divided by At* instead of At. 
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Fig. 8 Microscopic parameters of the citation dynamics versus network time t* . 
The latter is defined in such a way that the citation network grows linearly with t* . 
(a) m, the mean number of citations, (b) a, exponent of the attachment kernel, (c) 
A*, the rate constant. The continuous line shows empirical approximation A* = 
3.8/(t* +0.5)^ 



Figure [5] shows some citation parameters in this time frame. The sat- 
uration exhibited by m and a is clearly visible. This should be compared 
to Figsdl Hh- where the growth of m and a is more close to logarithmical. 
The attachment rate A*{t*) turns out to be almost the same as the A{t) 
dependence. This coincidence may be occasional. 

We conclude that the growth of the citation network follows Eq. 2] with 
time-independent fco ~ 1, time-dependent exponent a which gradually in- 
creases from 1 to 1.28, and the rate constant that decays with time as ~ 
This conclusion remains the same if we replace the physical time by the net- 
work time. By the way, the studies of US patent-to-patent citations by the 
histogram method yielded similar attachment exponent a ~ 1.2 — 1.27 p!51[T71 
[21] . All this indicates that the citation networks experience the autocatalytic 
growth with the superlinear attachment kernel. Although the nonlinearity is 
weak, it leads to far-reaching consequences which we analyze below. 
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5 Divergence of the citation lifetime - dynamical phase transition 
towards immortality 

In what follows we analyze the consequences of the nonlinear growth mecha- 
nism of the citation network. In particular, we demonstrate that the nonlin- 
earity is responsible for the enormous spread of citation lifetimes of scientific 
papers. To show this we consider citation dynamics of individual papers and 
distinguish between the initial period of to =2-3 years when a paper makes 
an immediate impact and the subsequent period when the citation dynamics 
of this paper to some extent is built on its initial success. The time depen- 
dence of the total citation count of a paper during this later period can be 
crudely estimated by integrating Eq. |4] with respect to time as if k were a 
continuous variable. In what follows we focus only on the papers with k » 1 
(moderately- and highly-cited papers), in such a way that the term fcg in Eq. 
|3]can be neglected. We also neglect for a moment the stochastic component 
of k. Assuming time- independent exponent a we integrate Eq. |4]and find 



Here, ko = k{to) stands for the number of citations garnered by a paper 
during initial period to (it shouldn't be mixed with ko that appears in Eq. 
E]) , and 5 = a — 1. For S << 1 Eq. |8] reduces to a more transparent form 



Consider two cases. 

1. S — -linear growth. The k{t) dependence can be factorized, k{t) = 

fcoe'^'o , where the feg sets the scale and the factor e-^'o sets the time 
dependence of the citation count. Citation dynamics of all papers should 

follow the universal dependence e-^'o which is independent of fco. 

2. (5 7^ -nonlinear growth mechanism. In this case Eq. [5] can not be fac- 
torized and both the scale and the time dependence of the total citation 
count depend on fco. 

Basing on these considerations we provide the experimental evidence for 
the nonlinear growth by analyzing various measures of the paper longevity. 
One of them is a nonparametric measure introduced by Redner [3], namely, 
"citation age" < t >. It is defined with respect to the paper age t which is 
the number of years that passed after its publication. The citation age of the 
paper is nothing else but the mean age of the papers that cite it. 





k{t) — fege 




(9) 



< U >= 



Jq tidki 

h{t) 



(10) 



In the extreme case when the citations grow linearly in time, < ti >= t/2. 
Citation age exceeding t/2 indicates accelerating growth while citation age 
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below t/2 indicates some kind of saturation. Figure [9] shows citation age 
of the papers for t = 25 years. We observe that < t > increases with the 
number of citations as expected for the nonhnear growth mechanism, and 
eventually achieves the critical value of t/2 = 12.5 years. This means that 
there is appreciable number of papers whose citation dynamics didn't come 
to saturation and they are actively cited even 25 years after publication! 

Another way to illustrate such exceptional behavior is to approximate 
Eq. [5] by the exponential dependence 

m = I [l - e-^(-^)] (11) 

where K is some scale factor, Z\ is a (small) delay between the publication of 
the paper and the onset of citations, and /? is the rate. The latter is negative 
when k{t) accelerates with time and positive when k{t) comes to saturation. 
In this latter case r = 1//3 has the meaning of citation lifetime. It is related 
to citation age (Eq. [TTO as follows: for the exponential dynamics and in the 
long-time limit r + ~< i >. 

We measured k{t) for all papers in our dataset, approximated it using 
Eq. [TT] and found microscopic parameters /3 and A. Since these microscopic 
parameters strongly fluctuate, we binned all data into 40 logarithmically- 
spaced bins and considered the average over the papers in each bin. The 
results are shown in Fig. [HI The rate fi changes sign and becomes positive for 
highly-cited papers indicating accelerating citation dynamics. This change 
of sign occurs at the same threshold where citation age becomes equal to 
t/2 (Eq. [T0|) . The citation lifetime is r ~ 5 — 6 years for low-cited papers, 
for moderately- and highly-cited papers citation lifetime increases and even 
diverges, as it is predicted by Eq. [S] 

The divergence of the citation lifetime is a direct consequence of the 
nonlinear autocatalytic growth and it demonstrates the tendency of citation 
network to develop a few hubs that attract the majority of citations. In the 
cumulative citation distribution these hubs appear as "runaways" [8]. These 
most highly-cited papers have all chances to achieve infinite citation lifetime. 
This observation extends the well-known adage "the rich get richer" to "the 
rich live longer" . 



6 Discussion 



6.1 Why is the growth mechanism of citation networks so close to linear? 

When viewed from the perspective of network dynamics, the preferential at- 
tachment mechanism does not favor any particular value of the attachment 
exponent. From such perspective the ubiquity of linear or nearly linear pref- 
erential attachment is enigmatic. However, if we consider network dynamics 
from the perspective of a single node, the ubiquity of nearly linear preferential 
attachment appears naturally. 

Indeed the linear preferential attachment in the context of citations means 
that citation dynamics of the papers published in the same year has the same 
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Fig. 9 Citation age < t > (Eq. fTD)) and citation lifetime r (Eq. fTTj) versus k{t = 25)- 
the number of citations garnered by a paper after 25 years. The data were binned 
and each data point represents the average over one bin. Note divergence of r at 
the threshold of k{t) « 600. The inset shows citation rate P — 1/t which changes 
its sign and becomes positive for k{t) > 600. The growth of < t > and r with k is 
a signature of the nonlinear autocatalytic process. 



functional dependence and differs only in scale (we totally neglect here the 
stochastic character of the citation process). The difference between total 
number of citations of these papers is due to initial conditions, namely the 
number of citations that the papers garnered during first 2-3 years after 
publication. This is related to the number of readers which is determined by 
the journal's circulation. Since the majority of readers are graduate students 
who tend to copy once prepared reference list in all their publications, the 
citation lifetime of a paper that some Ph.D. student came across, is the 
duration of his Ph. D. stay, namely 3-5 years. Therefore, the initial impact 
of a paper on research groups that undertook to cite it, usually continues for 
1-2 generations of the Ph. D. students, namely for 6-10 years (see Fig. IH]). 

If the above scenario were true for all papers, then the growth of the 
citation network would follow linear preferential attachment and the citation 
lifetime of all papers would be more or less the same. Figure [9] shows that 
while the citation lifetime of the vast majority of papers is indeed 6-10 years, 
there are quite a few papers that have much longer lifetime. We believe that 
these are the papers that induce "chain" reaction or cascade. Namely, the 
researchers can pick up such paper not by reading the journal where it was 
published but through the impact of this paper on other research groups. In 
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this case the paper starts it citation career in a new research group and its 
citation hfetime increases by another 6-10 years. Such process of spreading 
the ideas is similar to epidemiological process [51115^ and to the redirection 
(copying) mechanism [30,33^. It seems that the papers whose impact propa- 
gates through the cascade process are responsible for the nonlinear growth of 
the citation network. The fraction of such papers in the whole pool of papers 
determines the degree of deviation of the attachment exponent from unity. 
The fact that this deviation is small, indicates that only a small fraction of 
papers ignites the chain reaction or cascade. Our measurements indicate 
that these are the papers that garnered at least ^ 50 — 70 citations at some 
moment in their citation career. 

This cascade mechanism is specific for the citation network and it does 
not necessarily occurs in other networks. Therefore, the growth mechanism 
of the complex networks other than citation network (Table I) can still follow 
linear preferential attachment. 



7 Conclusions 

The dynamics of citation network is driven by the nonlinear autocatalytic 
growth with the attachment exponent a 1.2 — 1.3. The small but appre- 
ciable deviation of the growth process from linearity leads to a dramatic 
dynamical phase transition: the papers that exceed at some stage a certain 
number of citations become practically immortal: their citation lifetime di- 
verges. In the language of epidemiology these papers become endemic. 
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